The MD-join: An Operator for Complex OLAP
نویسندگان
چکیده
OLAP queries (i.e. group-by or cube-by queries with aggregation) have proven to be valuable for data analysis and exploration. Many decision support applications need very complex OLAP queries, requiring a fine degree of control over both the group definition and the aggregates that are computed. For example, suppose that the user has access to a data cube whose measure attribute is Sum(Sales). Then the user might wish to compute the sum of sales in New York and the sum of sales in California for those data cube entries in which Sum(Sales) > $1,000,000. This type of complex OLAP query is o fen dificult to express and diflcult to optimize using standard relational operators (including standard aggregation operators). In this paper we propose the MD-join operator for complex OLAP queries. The MD-join provides a clean separation between group dejinition and aggregate computation, allowing great flexibility in the expression of OLAP queries. In addition, the MD-join has a simple and easily optimizable implementation, while the equivalent relational algebra expression is often complex and diflcult to optimize. We present several algebraic transformations that allow relational algebra queries that include MD-joins to be optimized.
منابع مشابه
Efficient Computation of Subqueries in Complex OLAP
Expressing complex OLAP queries using group-by, aggregation, and joins can be extremely difficult. As a result of this, many alternate ways of expressing such queries have been developed by database researchers. The use of nested query expressions (subqueries in SQL), are a natural part of these techniques. Recent work has demonstrated how any nested query expression can be rewritten using alge...
متن کاملOLAP on Complex Data: Visualization Operator Based on Correspondence Analysis
Data warehouses and Online Analysis Processing (OLAP) have acknowledged and efficient solutions for helping in the decisionmaking process. Through OLAP operators, online analysis enables the decision-maker to navigate and view data represented in a multi-dimensional manner. But when the data or objects to be analyzed are complex, it is necessary to redefine and enhance the abilities of the OLAP...
متن کاملOLAP Visualization Operator for Complex Data
Data warehouses and Online Analysis Processing (OLAP) have acknowledged and efficient solutions for helping in the decisionmaking process. Through OLAP operators, online analysis enables the decision-maker to navigate and view data represented in a multi-dimensional manner. But when the data or objects to be analyzed are complex, it is necessary to redefine and enhance the abilities of the OLAP...
متن کاملSpecial issue on DOLAP 2015: Evolving data warehousing and OLAP cubes to big data analytics
• The first paper [1], titled “Categoryand selection-enabled nearest neighbor joins”, is a database processing paper. The paper proposes an efficient algorithm (roNNJ) to compute a nearest neighbor join with grouping attributes and selection predicates. The authors extend the state-of-the-art for which nearest neighbor joins that are not category-enabled must process each category independently...
متن کاملAnswering Multidimensional Queries on Cubes Using Other Cubes
Recently there is an important interest on On-Line Analytical Processing (OLAP) technology. In this context, in order to facilitate complex analysis, data are usually modeled multidimensionally, where multiple hierarchies are associated with the dimensions. These Multidimensional (MD) data structures are called data cubes. In the existing OLAP products, the user interaction is limited to one op...
متن کامل